Simple Baseline for Visual Question Answering

Exploring Models and Data for Image Question Answering

Visual7W: Grounded Question Answering in Images

Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction